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Abstract 

The Starobinsky model of inflation, consistent with Planck 2015, has a peculiar form of 
the action, which contains the leading Einstein term R, the term with a huge coefficient, 
and negligible higher-order terms. We propose an explanation of this form based on compact- 
ification of extra dimensions. Once tuning of order 10“'* is accepted to suppress the linear 
term R, we no longer have to suppress higher-order terms, which give nontrivial corrections 
to the Starobinsky model. We show our predictions of the spectral index, its runnings, and 
the tensor-to-scalar ratio. Finally, we discuss a possibility that quantum gravity may appear 
at the scale A > 5 x 10*® GeV. 


1 Introduction 


The precise cosmic microwave background (CMB) observations favor the plateau-type inflaton 
potentials. Although the combined analysis of BICEP-Keck-Array-Planck resulted in a finite 
value of the tensor-to-scalar ratio, r = O.OdSlggg^ [I], Planck 2015 itself has not found any 
evidence of detecting it but obtained an upper bound, r < 0.103 (Planck TT -|- lowP) [2]. In 
fact, combining these results in a stringent limit, we get r < 0.08 (Planck TT-|-lowP-|-BKP) [2]. 
There are many models, including the Starobinsky model [3], the Higgs inflation model [3], and 
cosmological attractors (see Ref. [5] and references therein), whose predictions are at the center 
of the Planck constraint. Among other features, the Starobinsky model of inflation specifically 
does not require introduction of an inflaton field by hand: the inflaton degree of freedom emerges 
from a higher-order gravitation term. 

The well-known form of the mode0 is [7] 

S = + (1) 

where Mp ~ 2.4 x 10^® GeV is the reduced Planck mass, R is the Ricci scalar, and m is a 
mass-dimensional parameter (actually the inflaton mass). In the IR limit, R <C m?, it reduces 
to the General Relativity (with the cosmological constant, which should be fine-tuned to be a 
small number and hence we ignore it here), which is well established in wide scales. On the other 
hand, when R becomes comparable with m^, the second term becomes important. Introducing an 
auxiliary scalar field and applying Weyl transformation and scalar field redefinition, the model 
is recast in the form of Einstein gravity with a canonically normalized scalar field cf) with the 
following scalar potential HHIO]: 

Rstarobinsky = (l - ' , (2) 

where m is interpreted as the inflaton mass at the vacuum. 

If one interprets in Eq. m as the expansion parameter of the theory, there are no reasons 
to expect absence of even higher-order terms like R^ and (aside from terms involving Ricci and 
Riemann tensors and derivatives, which we neglect because they generically introduce negative 
norm states (ghosts) [12]) with negative powers of m^. Such higher-order terms are extensively 
discussed in non-supersymmetric [131t21| as well as supergravity theories [221127] . That is, Eq. © 
should be augmented by higher-order terms as follows: 



(3) 


^ The original formulation involves all quadratic curvature invariants such as R^''Rfiv and Rfiupa ■ In 

conformally flat space-time, the effects of these terms are represented by the scalar curvature term as in Eq. ©• 
Note also that the de Sitter expansion in f{R) gravity was discussed in Ref. [6]. 
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where a 2 = 1/12, and with (n > 3) naively expected to be of order 1. These terms, however, 
easily spoil the success of the inflationary model by substantially modifying the inflaton poten¬ 
tial ©• So in any way, the higher-order terms must be sufficiently suppressed to maintain the 
predictions of the model. 

If the higher-order terms involving negative powers of are suppressed by phenomenological 
reasons, what is the scale of the suppression? The “next-to-natural” expectation would be that 
it is the reduced Planck scale Mp, since there are no other scales in the theory. In this case, the 
action is expanded by the Planck scale Mp with order 1 coefficients, but then the coefficient of 
the second term must be somewhat large (02 — 5 x 10®). This is the well-known peculiarity 
of the Starobinsky model, which we try to partially explain here. 

In this paper, we take a view that the large coefficient of the term is actually an overall 
coefficient of the action. As we will see, such a large overall factor naturally emerges in theories 
with extra space-time dimensions. (See Refs. [28H32j and references therein for previous works 
on extensions of the Starobinsky model in extra dimensions.) Although we have to suppress 
the coefficient of the linear term R by tuning of order 10“^, we do not have to additionally 
suppress the higher-order terms. Moreover, it leads to a Starobinsky-like model with interesting 
observational consequences. At the end, we predict inflationary observables, and obtain the lower 
bound on the fundamental scale of the underlying higher-dimensional theory. 

2 Starobinsky-like model from extra dimensions 

Suppose that the underlying gravitational theory lives in D space-time dimensions with a char¬ 
acteristic energy scale A. Its effective action is described by 

S = A^ f d^xyj-go 

where bn are dimensionless coefficients, go is the determinant of the D-dimensional metric, and 
Rd is the D-dimensional Ricci scalar. Here we require the absence of ghosts and higher derivative 
interactions. We also require that the low-energy modes of our effective field theory are the 
graviton and the scalaron. Hence we assume the /(i?)-type theory (see e.g. Refs. [331135] for 
reviews of f{R) gravity). Also, we neglect possible nonminimal couplings with matter fields 
for simplicity. Assuming b 2 > 0, we may set 62 = 1 by redefinition of the scale A or by Weyl 
transformation. It should be stressed that this is not tuning but just a matter of convention. 
(The 62 -dependence can be easily reproduced by replacing bi by 61/62 after Eq. ([ 6 ]).) Upon the 
compactification to four dimensions, the action becomes 



2 



where c = Vd- 4 :^^~^ is the overall dimensionless factor, Vd-a is the volume of the compactified 
extra dimensions, and R is the four-dimensional Ricci scalaiQ. For example, if we take L) = 10 
(c/. superstring theory) and the compactification radius L = , which satisfies L ~ 30/A, we 

can naturally obtain a large overall factor c ~ 5 x 10®. Note that if we take a larger number 
of dimensions, it becomes easier to obtain huge overall factor c with the compactification radius 
being the same order as A“^. 

Basically, all the coefficients bn are expected to be order 1, but bo should be fine-tuned to 
suppress the cosmological constant. Furthermore, we require that bi also happens to be very 
small. Otherwise, the situation is similar to Eq. ([3]) with m replaced by A. Since the limit 6 i —>• 0 
does not enhance symmetries of the underlying theory, it is regarded as tuning. (However, in the 
perspective of the compactified four-dimensional theory, an approximate scale symmetry appears 
in this limit. See more discussion in Sect. [H) The tuning is to be done so that the renormalized 
value of bi becomes small. To reproduce the Starobinsky model, we set 


cbiJS? 


Afg 
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with | 6 i| <C 1. Then Eq. ([5]) becomes 


Mg 

12m2 


~ 5 X 10®, 


S = 



Mg 

12m2 



( 6 ) 

(7) 


so the suppression scales for n > 3 become larger than the inflation scale m as we tune bi to be 
small. This is the form of the action we advocate. 

The extension of the Starobinsky model with an term was studied in Ref. which gives 
us a constraint on each coefficient. 


nbn 



^ j^g-2n-|-2.6 


(n > 3). 


( 8 ) 


The constraint for n = 3 is | 6 i| < 10“®'® up to an order 1 factor I 63 I. With |5i| satisfying 
this bound, the constraint dSD is also satisfied for n > 4. Using the results of Planck 2015, 
Ug = 0.9655 ± 0.0062 (Planck TT -|- low P) [2], we obtain more stringent 95% confidence-level 
bounds on bi: 


2.5 X 10“"^ < ft < 1-3 X 10“"^ 

II 

01 

(9) 

1.4 X 10“"^ < 6 < 2.2 X 10““^ 

II 

( 10 ) 


^ The above compactification assumes flat extra dimensions so that Rn ~ R- For generic extra dimensions, we 
have Rd = R+0{1/L^) where L is the typical size of extra dimensions (compactification radius). Thus, coefficients 
bn receive only corrections like b„ ^ bn + O ((l/L^A^)). Since we take LA large, these corrections are neglected. 
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where b = 6361 . Note that bi < 0, and 63 is expected to be of order 1, so the bound on bi is 
roughly | 6 i| < 2 x 10 ““^ §| 

Let us compare how the situation has been improved. The coefficients of the Starobinsky- 
like models in effective field theory ([3]) should obey an constraint analogous to the inequality ([ 8 |) 
with 1 61 1 replaced by an order one number. The first higher-order coefficient 03 is then constrained 
as severely as b, i.e., |a 3 | < 0(10“^). However, the required tuning becomes increasingly more 
sever in the large n limit, and an inhnite tuning is required with n ^ oo0 In our proposal, only 
one parameter tuning of bi is needed, and its precision level is that of the least severe term in 
the original framework. 

3 Implications for inflationary observables 

For self-completeness and with the latest Planck data, let us discuss the effects of the extra terms 
in the action on the inflationary observables, namely the scalar spectral index Ug, its running Og, 
and the tensor-to-scalar ratio r. As we saw above, effects of higher-order (n > 4) terms are more 
suppressed than the n = 3 term, so we neglect the higher-order terms in the following analyses. 
The addition of the term has been studied since the early days [II1E6], but recent discussions 
include Refs. [141120] . Here, we summarize the properties of the scalar potential in the Einstein 
frame and inflationary observables, and compare with the latest observational data. Considering 
observation of the 21 cm line from hydrogen atoms, we obtain the opposite conclusion to that in 
the literature CD- 

Under the standard procedure, the Jordan frame action ([7]) up to the third term is transformed 
into the Einstein frame action with the following potential for a canonical scalar field (j), 



( 11 ) 


Here and hereafter, we take the reduced Planck unit Mp = 2.4 x 10^®GeV = 1. When we set 
6 = 0, it reduces to the Starobinsky potential ([2]). If 6 is negative, the potential blows up in the 
large field region. If b is positive, the potential has a runaway behavior, and there is the possibility 
of topological inflation as discussed in Ref. m- If we retain b up to the leading nontrivial order, 
the potential is 

U = Ustarobinsky X (^1 - (l - + 0{b^), (12) 

® One can soften the tuning of bi by an order of magnitude by tuning 63 instead while keeping the value of b. 
Further increase of |&i| makes the total tuning worse, which may be defined as the product of all \b„\ less than one. 
If we accept |6„| as small as the combinatorial factor 1/n!, |6i| can be increased one more order of magnitude. 

^ It may be possible for some higher-order terms to conspire to cancel major effects on the inflationary potential, 
but we do not discuss this case for simplicity. 
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where Vstarobinsky is given in Eq. ([2]). Of course, the large field behavior {(j) —>■ oo) depends also 
on the higher-order terms 1151120] . but the leading order is enough for our purpose. 

In the leading order of the deformation parameter b, the spectral index Us, the tensor-to-scalar 
ratio r, the running of the spectral index Og, and its running /3s are obtained as 


1 - Us ~ 




N 

12 

Jp 


16, 


27 


32, 


1 + —hN^ = 


N 27 


. 16, 12 64, 

~ 27^^ ) “ iV2 “ 9 


Os ^ - 


iV 2 


, 16,,,2A 2 32^ 

27&1V j - ^2 + 27^’ 


ATS 


/3s:^-^ l + -6iV =-1^7^- 


16 


ATS 9Ar2 


h, 


(13) 

(14) 

(15) 

(16) 


where higher-order terms in 1/N are also neglected. The results for ns,r, and Os are consistent 
with the n = 3 case in Ref. m, and we additionally obtain the expression for /3s. 

Varying the value of the parameter b, we obtain a prediction of the model as curves in the 
(ns,r)-plane in Fig. [TJ The region of positive b corresponds to the left of the large point (the 



spectral index ris 


Figure 1: Prediction of the model in the (ns,r)-plane as we vary the parameter b = 6361 . 
The blue (top) line and red (bottom) line correspond to Ne = 50 and Ng = 60, respec¬ 
tively. The large dots show the prediction of the Starobinsky model (6 = 0), and the small 
dots correspond to points of 6 = n x 10~^ with integer n. The green contours are the Planck 
TT-|-lowP-|-BKP-|-lensing-|-BAO-|-JLA-|-77o constraints (traced from Fig. 21 in Ref. [38]). 


Starobinsky model, 6 = 0), and negative to the right. This is because the positive 6 makes the 
potential flatter (e smaller) and more curved {\rj\ larger), so both Ug and r are smaller. The 
constraint on Ug gives a constraint on 6 as the inequalities (l9|) and m- The correction from 
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the extra term does not drastically change the value of r to improve the detection prospect of r. 
However, the future prospect of precision of the spectral index Ug by the 21 cm line and CMB 
observation will be 5 x 10“^^ |39j . so it will become possible to distinguish the Starobinsky model 
{b = 0 ) and our extension if b is at least of order 10 “®. 

The prediction of the model in the (ng, as)-plane is shown in Fig. [2j As can be seen from the 



spectral index Uj 


Figure 2; Prediction of the model in the (ng, as)-plaiie as we vary the parameter b = 6361 . The 
blue (bottom) line and red (top) line correspond to Ne = 50 and Ne = 60, respectively. The large 
dots show the prediction of the Starobinsky model (6 = 0), and the small dots correspond to the 
points of 6 = n X 10“^ with integer n. The green contours are the Planck TT, TE, EE + low P 
constraints (traced from Fig. 4 in Ref. [2]). 


figure and Eq. m, the positive 6 makes the absolute value of the running smaller. The running 
in the model (jlip with the rising correction (6 < 0) was discussed in Ref. [20] in the context of 
power suppression in low multipolesH The running Og will be also measured by the 21 cm line 
and CMB observation at the precision of 3 x 10“^ [39|, and this has sensitivity on our deformation 
parameter 6 of order 10“^. This information combined with that of Ug will give us a test of our 
model. 

On the other hand, the running of running /3g is of order 10“^ in our case, and it cannot be 
measured by the near future observations [39]. In other words, our predictions can be falsified 
by detection of the running of running /3s. 

Let us discuss the uncertainties due to the reheating epoch in our predictions. The uncertainty 
of the Ng affects Ug in particular, see Eigs. [T]and[ 2 j We estimate it for the original Starobinsky 
model as = 56.0(1) where we have used the expression of in Ref. [43j and the inflaton decay 

® It is interesting to note that such steepening arises also in the (old-minimal) supergravity embedding of the 
Starobinsky model for some initial conditions | 401442 |. 
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rate in Ref. [H], we have assumed the equation-of-state parameter during reheating as w = 0 
due to the coherent oscillation of the inflaton held, and the uncertainty estimate is only of order 
of magnitude. The analysis in Ref. [H] is based on the perturbative decay. Nonperturbative 
production of the inhaton particles, in particular localized objects called oscillons or I balls, (as 
well as light particles) is ineffective even when taking the metric preheating into account [45j . 
The dominant source of the uncertainty of iVg comes from that of the normalization of the CMB 
power spectrum. The effect of the uncertainty on n* is roughly the same magnitude as that from 
higher-order corrections in terms of N^, but the latter can be improved relatively easily {e.g. by 
numerical calculation). The higher-order corrections for the Starobinsky model are available in 
Ref. [i6] . 

When introducing the deformation parameter b, the effect of changing the shape of the poten¬ 
tial contributes to the uncertainty at most ANf, = 0(10“^). The dominant source of uncertainty 
may come from change of the reheating temperature depending on the assumptions on inflaton- 
matter couplings. If matter fields couple to gravity minimally after dimensional reduction to 
four dimensions, the inflaton-matter couplings are the usual Planck-suppressed couplings due to 
Weyl rescaling. If there are interaction vertices suppressed by A“^, the reheating temperature is 
expected to be scaled as Mp/A. Since A is related to bi as in Eq. ([6|), increases by 1.9 to 2.7 
as |5i| is varied from 10““^ to 10“^ (c/. footnote [3|) . Combining these discussions with Figs.[T]and 
[21 we conclude that we can distinguish our deformation from the original Starobinsky model if 
|6i| is of order 10“^, but |6i| of order 10“® seems difficult to distinguish taking the uncertainties 
into account. 

If there is a nonminimal coupling between a scalar {e.g., a Higgs boson) h and Ricci 

scalar R, many quanta of h can be produced by preheating with |C| ^ 5 [17]. However, if we adopt 
the maximal production case to the Standard Model Higgs, the electroweak vacuum will become 
unstable, which should be avoided. The value of ^ is constrained to be at most of order 1. Precise 
estimation of the reheating temperature in this case is subject to the uncertainties of the top mass, 
the Higgs mass, and the strong coupling constant through the renormalization group running of 
the Higgs quartic coupling. For example, if we take A = 5 x 10“^ at /r = {h?) = 10^*^ GeV 

(near the central line of the right panel of Fig. 1 in Ref. [IH]; /r is the renormalization scale), the 
preheating effect is subdominant compared to the perturbative decay as discussed above. On the 
other hand, with the same value of A at ^ {h?) = 10^^ GeV (near the edge of the Scr band 

of the right panel of Fig. 1 in Ref. @8]), we hnd AN^ ~ 0.9. Thus, once we assume existence of 
the nonminimal coupling, the small change of the observables due to the modification parameter 
b may or may not be buried in the effect of the nonminimal coupling depending on the precise 
values of the electroweak observables. 
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4 Discussion 


In this paper, we proposed a new interpretation of the Starobinsky model as a low-energy effective 
theory of a higher-dimensional theory whose characteristic energy scale is denoted by A. Com- 
pactification of extra dimensions naturally introduces a large overall factor. With the tuning of 
|6i| < 2 X 10“^ (in addition to the one for the cosmological constant), we obtain the Starobinsky 
model augmented with higher-order terms suppressed enough to be consistent with the Planck 
2015 results. Compared to taking a large parameter only in front of the B? term, taking a small 
parameter is regarded as less unnatural in the sense that it may happen by accidental cancela¬ 
tion of several contributions. We have also argued that the original version suffers from tuning 
of infinite parameters and infinite precision, whereas finite tuning of one parameter is enough in 
our framework. If the deformation is indeed of order 10“^, the model can be distinguished from 
the original Starobinsky model (6 = 0) by future observations of CMB and the 21 cm line. 

Predictions on inflationary observables studied in the previous section are consequences of 
the action but do not crucially depend on the underlying assumption (j3|). Here, we briefly 
discuss another possibility to obtain the advocated action ([S]). One of the reasons for the unnatural 
expansion of the Starobinsky model action may reside in the fact that we regard the Einstein term 
as the fundamental term, and the other terms are “secondary” in the sense that they originate 
from quantum corrections. In contrast, we can take a view that the fundamental or main term 
is the second term E? rather than R (see e.^.. Refs. |491I55| in this kind of direction). The pure 
E? theory, S = cf d^Xy/—gE?, does not have a dimensionful constant, and it is scale invariant|§ 
Inflation in this theory is in the pure de Sitter universe, and it eternally inflates. Note that the 
coefficient c of the action cannot be absorbed into R by scale transformation simply because 
the action is scale invariant, and it is legitimate to take the coefficient as a huge or minuscule 
number. The former eventually corresponds to the Starobinsky model. If the scale symmetry is 
spontaneously broken, perhaps after coupling to matter sector, then a scale A is generated. This 
will lead to the form of the action dS]). The fact that |6i| should be suppressed is unchanged, 
and we have the same predictions (fT^ . (fTT)) . (fT5]) . and (fTbl) . In this context, smallness of |6i| 
corresponds to soft breaking of the scale symmetry, and hence is technically natural. This is 
same in our scenario: After compactification, the approximate scale symmetry appears with a 
small symmetry breaking parameter. With technical naturalness combined with the explanation 
of the huge coefficient of E? in terms of extra dimensions, the form of the Starobinsky model is 
naturally understood. 

It would also be useful to discuss another possible explanation for the tuning of hi (the 
cosmological constant also). From the effective theory point of view, the first few terms in the 
low energy expansion have very small values in our model. Indeed, such a situation sometimes 

® Aspects of quadratic gravity were recently revisited in Ref. [56], and supergravity embedding of the pure 
theory was studied in Ref. m- See also the inflation scenario based on broken scale invariance in Ref. |58 |. 
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occurs in the effective action of the order parameter near the phase transition point (e.g. the 
Lifshitz point of the Nambu-Jona-Lasinio model [5911601 1 [611162] . Based on this analogy, let us 
think of the metric as some “order parameter” and explore the “phase structure” of f{R) gravity, 
using the action S = f d‘^x^—gc{—dQ — diR + E?) with c > 0. Under the standard procedure, we 
rewrite this as a theory of a canonical scalar field in the Einstein frame. The potential is bounded 
below as long as d\ — 4do > 0. The shape of the scalar potential dramatically changes depending 
on the sign of di: (i) For di = 0, the potential is constant, and its sign (dS, AdS, or Minkowski) is 
determined by the value of do- (h) For di > 0, just as in the Starobinsky model, the potential has 
a minimum as well as a flat region. Inflation is therefore realized, (iii) For di < 0, the minimum 
of the potential is at (/> = oo (runaway potential). The inflaton slow-rolls toward 4> = oo and the 
eternal inflation in the approximate dS is realized. It would be interesting if we could interpret 
inflation as a consequence of the phase transition from phase (ii) through (i) to (iii). 

Finally, let us estimate the characteristic scale A in the higher dimensional theory. Using the 
upper bound on |6i|, its lower bound is given by 

> 5 X lO'^GeV. (17) 

This scale is close to the Grand Unified Theory scale, so it is tempting to relate the higher di¬ 
mensional theory with Grand Unification. Moreover, the Planck scale Mp is not the fundamental 
scale in our viewpoint, and quantum gravity effects may appear at this scale. In the context of 
superstring theory, we can identify the scale A as the string scale mg (see the discussion after 
Eq. (HI)). Furthermore, modification of gravity at a length scale larger than the Planck length (or 
in other words, low cut-off theory) is favored from a variety of perspectives including a solution 
to the cosmological moduli problem [63] and a way to preserve global symmetries to a good 
precision [M], which is crucial, e.g., for the axion solution to the strong CP problem [SS]. These 
discussions imply that we may see the footprints of quantum gravity or string theory in the sky, 
which might simultaneously give us some hints on cosmological and particle physics problems. 
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